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<110> APPLICANT: Alam, Maqsudul 
Larsen, Randy 

<120> TITLE OF INVENTION: HEME PROTEINS HEMAT-HS AND HEMAT-BS AND THEIR USE IN 

MEDICINE AND MICROSENSORS 
<130> FILE REFERENCE: 201040/1020 
<140> CURRENT APPLICATION NUMBER: 09/455, 978B 
<141> CURRENT FILING DATE: 1999-12-06 
<160> NUMBER OF SEQ ID NOS : 86 
<170> SOFTWARE: Patentln Ver. 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 1470 
<212> TYPE: DNA 

<213> ORGANISM: Halobacterium salinarum 
<400> SEQUENCE: 1 

atgagcaacg ataatgacac tctcgtgacc gccgacgttc ggaacgggat cgacgggcac 
gcactcgcgg accggatcgg cctcgacgag gcggagatcg cgtggcggct gtcgttcacc 
gggatcgacg acgacacgat ggccgcgctc gccgccgaac agccgctgtt cgaagccacc 
gcggacgcgc tggtgaccga cttctacgac cacttggagt cctacga.gcg 



caactcaaag agacgcaggc 
gagtacgccg cccagcgcgc 
gacgtctatc tgggcgcgta 
gacgtggtcg ccgaccgcgg 
ttcctgccga tgttgaagct 



cacacaggac 
cgagtacttg 
ccgtatcggg 
cacgcgatac 
cgaggaggcg 
gttgaccttc 
ccacgacgag 
accgctgtcc 
ggcccgcacc 



ctcttcgcga actccacgaa gaccgtcgag 
ctgggcctcg ggcgcggcga gtacgacacc 
aagatacacg acgtgctcgg gctcggaccg 
tacacggggc tgttggacgc gcttgccgac 
gccgccgccg tcgacgaact cgtggcccgg 

gatcagcaga tcgcaatgga cacctacatc gactcgtacg cccagcgcct 
atcgacagcc gccaggagtt ggcgaacgcg gtcgccacgc acgtggaagc 
tcgctggagg cgacctcgca ggacgtcgcc gagcgcacgg acacgatgcg 
gacgaccagg tcgaccgcat ggctgacgtc agccgtgaga tatccagcgt gtccgcgagc 
gtcgaggagg tcgcctcgac ggccgacgac gtccgccgga ccagcgagga cgccgaggcg 
ctggcccagc agggcgaggc ggccgccgac gacgcgctcg ccacgatgac cgacatcgac 
gaggcgaccg acggcgtcac cgcgggcgtc gaacagctcg gcgagcgcgc cgccgacgtc 
gaatcagtga ccggcgtgat cgacgacatc gccgagcaga cgaacatgct ggcgctgaac 
gcgtccatcg aggccgcccg cgccggggag gcgggcgagg ggtttgcggt cgtcgccgac 

42 gaggtcaagg ccctcgccga ggagtcccgc gagcagtcca cgcgcgtcga ggagctcgtc 

43 gagcagatgc aggcggagac cgaggagacg gtcgaccagt tggacgaggt caaccagcgc 

44 atcggcgagg gcgtcgagcg cgtcgaggag gcgatggaga ccctccagga gatcaccgac 
4 5 gccgtcgagg acgccgcaag cgggatgcag gaggtgtcga cggcgaccga cgaacaggcg 

46 gtgagcaccg aggaggtcgc cgagatggtc gacggtgtcg acgaccgcgc gggcgagatc 

47 gcggccgccc tcgatgacat cgccgacgcg accgatcagc aggtccggac cgtcgaggag 

48 gtccgcgaga cggtcggcaa gctcagctag 

51 <210> SEQ ID NO: 2 

52 <211> LENGTH: 489 

53 <212> TYPE: PRT 

54 <213> ORGANISM: Halobacterium salinarum 
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56 <400> SEQUENCE : 


2 
























57 Met 


Ser 


Asn 


Asp 


Asn 


Asp 


Thr 


Leu 


Val 


Thr 


Ala 


Asp 


Val 


Arg 


Asn 


Gly 


58 1 








5 










10 










15 




60 He 


Asp 


Gly 


His 


Ala 


Leu 


Ala 


Asp 


Arg 


He 


Gly 


Leu 


Asp 


Glu 


Ala 


Glu 


61 






20 










25 










30 






63 He 


Ala 


Trp 


Arg 


Leu 


Ser 


Phe 


Thr 


Gly 


He 


Asp 


Asp 


Asp 


Thr 


Met 


Ala 


64 




35 










40 










45 








66 Ala 


Leu 


Ala 


Ala 


Glu 


Gin 


Pro 


Leu 


Phe 


Glu 


Ala 


Thr 


Ala 


Asp 


Ala 


Leu 


67 


50 










55 










60 










69 Val 


Thr 


Asp 


Phe 


Tyr 


Asp 


His 


Leu 


Glu 


Ser 


Tyr 


Glu 


Arg 


Thr 


Gin 


Asp 


70 65 










70 










75 










80 


72 Leu 


Phe 


Ala 


Asn 


Ser 


Thr 


Lys 


Thr 


Val 


Glu 


Gin 


Leu 


Lys 


Glu 


Thr 


Gin 


73 








85 










90 










95 




75 Ala 


Glu 


Tyr 


Leu 


Leu 


Gly 


Leu 


Gly 


Arg 


Gly 


Glu 


Tyr 


Asp 


Thr 


Glu 


Tyr 


76 






100 










105 










110 






78 Ala 


Ala 


Gin 


Arg 


Ala 


Arg 


He 


Gly 


Lys 


He 


His 


Asp 


Val 


Leu 


Gly 


Leu 


79 




115 










120 










125 








81 Gly 


Pro 


Asp 


Val 


Tyr 


Leu 


Gly 


Ala 


Tyr 


Thr 


Arg 


Tyr 


Tyr 


Thr 


Gly 


Leu 


82 


130 










135 










140 










84 Leu 


Asp 


Ala 


Leu 


Ala 


Asp 


Asp 


Val 


Val 


Ala 


Asp 


Arg 


Gly 


Glu 


Glu 


Ala 


85 145 










150 










155 










160 


87 Ala 


Ala 


Ala 


Val 


Asp 


Glu 


Leu 


Val 


Ala 


Arg 


Phe 


Leu 


Pro 


Met 


Leu 


Lys 


88 








165 










170 










175 




90 Leu 


Leu 


Thr 


Phe 


Asp 


Gin 


Gin 


He 


Ala 


Met 


Asp 


Thr 


Tyr 


He 


Asp 


Ser 


91 






180 










185 










190 






93 Tyr 


Ala 


Gin 


Arg 


Leu 


His 


Asp 


Glu 


He 


Asp 


Ser 


Arg 


Gin 


Glu 


Leu 


Ala 


94 




195 










200 










205 








96 Asn 


Ala 


Val 


Ala 


Thr 


His 


Val 


Glu 


Ala 


Pro 


Leu 


Ser 


Ser 


Leu 


Glu 


Ala 


97 


210 










215 










220 










99 Thr 


Ser 


Gin 


Asp 


Val 


Ala 


Glu 


Arg 


Thr 


Asp 


Thr 


Met 


Arg 


Ala 


Arg 


Thr 


100 225 










230 










235 










240 


102 Asp 


Asp 


Gin 


Val 


Asp 


Arg 


Met 


Ala 


Asp 


Val 


Ser 


Arg 


Glu 


He 


Ser 


Ser 


103 








245 










250 










255 




105 Val 


Ser 


Ala 


Ser 


Val 


Glu 


Glu 


Val 


Ala 


Ser 


Thr 


Ala 


Asp 


Asp 


Val 


Arg 


106 






260 










265 










270 






108 Arg 


Thr 


Ser 


Glu 


Asp 


Ala 


Glu 


Ala 


Leu 


Ala 


Gin 


Gin 


Gly 


Glu 


Ala 


Ala 


109 




275 










280 










285 








111 Ala 


Asp 


Asp 


Ala 


Leu 


Ala 


Thr 


Met 


Thr 


Asp 


He 


Asp 


Glu 


Ala 


Thr 


Asp 


112 


290 










295 










300 










114 Gly 


Val 


Thr 


Ala 


Gly 


Val 


Glu 


Gin 


Leu 


Gly 


Glu 


Arg 


Ala 


Ala 


Asp 


Val 


115 305 










310 










315 










320 


117 Glu 


Ser 


Val 


Thr 


Gly 


Val 


He 


Asp 


Asp 


He 


Ala 


Glu 


Gin 


Thr 


Asn 


Met 


118 








325 










330 










335 




120 Leu 


Ala 


Leu 


Asn 


Ala 


Ser 


He 


Glu 


Ala 


Ala 


Arg 


Ala 


Gly 


Glu 


Ala 


Gly 


121 






340 










345 










350 






123 Glu 


Gly 


Phe 


Ala 


Val 


Val 


Ala 


Asp 


Glu 


Val 


Lys 


Ala 


Leu 


Ala 


Glu 


Glu 


124 




355 










360 










365 








126 Ser 


Arg 


Glu 


Gin 


Ser 


Thr 


Arg 


Val 


Glu 


Glu 


Leu 


Val 


Glu 


Gin 


Met 


Gin 


127 


370 










375 










380 
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129 


Ala Glu Thr Glu Glu 


Thr 


Val 


Asp 


Gin 


Leu 


Asp 


Glu 


Val 


Asn 


Gin 


Arg 


130 


385 


390 










395 










400 


132 


lie Gly Glu Gly Val 


Glu 


Arg 


Val 


Glu 


Glu 


Ala 


Met 


Glu 


Thr 


Leu 


Gin 


133 


405 










410 










415 




135 


Glu lie Thr Asp Ala 


Val 


Glu 


Asp 


Ala 


Ala 


Ser 


Gly 


Met 


Gin 


Glu 


Val 


136 


420 








425 










430 






138 


Ser Thr Ala Thr Asp 


Glu 


Gin 


Ala 


Val 


Ser 


Thr 


Glu 


Glu 


Val 


Ala 


Glu 


139 


435 






440 










445 








141 


Met Val Asp Gly Val 


Asp 


Asp 


Arg 


Ala 


Gly 


Glu 


lie 


Ala 


Ala 


Ala 


Leu 


142 


450 




455 










460 










144 


Asp Asp lie Ala Asp 


Ala 


Thr 


Asp 


Gin 


Gin 


Val 


Arg 


Thr 


Val 


Glu 


Glu 


145 


465 


470 










475 










480 


147 


Val Arg Glu Thr Val 


Gly 


Lys 


Leu 


Ser 
















148 


485 
























151 


<210> SEQ ID NO: 3 
























152 


<211> LENGTH: 1390 
























153 


<212> TYPE: DNA 
























154 


<213> ORGANISM: Bacillus subtilis 

















156 <400> SEQUENCE: 3- 

157 atgttattta aaaaagacag 

158 caaaaaaacc gcattcagct 

159 gtcaggttgg gagatgctga 

160 aatatcgtaa atatcgtcga 

161 gatatcatta atgatcacag 

162 caggaaatgt ttgcaggcgt 

163 tccatccatt taagaatcgg 

164 cttttgtcaa tgattgacat 

165 gccattaaag caacaacaaa 

166 caaagcgagt acaaccagac 

167 aaaattcaag aaacctctgg 

168 caagagcttg tggacaaatc 

169 tccagcactg ttgaagaaaa 

170 cagatgaaca aaattgacac 

171 gaaatcgcgc agcaaattga 

172 aaccttctct cgctcaatgc 

173 tttgctgtcg tggcaaatga 

174 actgtttctg agcttgtgaa 

175 aaagacgtga atgagctagt 

176 ttcgatgaaa tcgtccacag 

177 gatctgcaag cctttcttgg 

178 gcttccgttg attcgcttgt 

179 tacggccggt ttttttgcgt 

180 attacctgca 

183 <210> SEQ ID NO: 4 

184 <211> LENGTH: 432 

185 <212> TYPE: PRT 

186 <213> ORGANISM: Bacillus subtilis 

188 <400> SEQUENCE: 4 

189 Met Leu Phe Lys Lys Asp Arg Lys Gin Glu Thr Ala Tyr Phe Ser Asp 



aaaacaagaa acagcttact 
cacaaacaaa catgcagatg 
gctttatgtg ttagagcagc 
tgcgttttat aaaaaccttg 
ctcagttgac cgcttaaaac 
tatcgatgat gaatttattg 
ccttttgcca aaatggtata 
ttatgaagcg tccattacaa 
aatcttgaac ttagaacagc 
ccgtgatgaa caagaagaaa 
atcgattgcc attctgtttt 
tgaaggcatt tctcaagcat 
gtcgatcggc ggaaaaaaag 
aagccttgtc caaatcgaaa 
aaaaatcttc ggcatcgtca 
atctattgaa tccgcccgcg 
agtgcggaag ctttctgagg 
caatacgaat acacaaatca 
cagcgaaagt aaagaaaaaa 
catgaaaatc agcaaagagc 
agggcttcag gaagtcagcc 
catcctgaca gaagaataac 
tcattatgta aacttaaatt 



tttcagattc aaacggacaa 60 
tcaaaaaaca gctcaaaatg 120 
ttcagccact cattcaagaa 180 
accatgaaag ctcattgatg 240 
aaacgttaaa acggcatatt 300 
aaaagcgtaa ccgaatcgcc 360 
tgggtgcgtt tcaagagctc 4 20 
atcagcaaga actgctaaaa 480 
agcttgtcct tgaagcgttt 540 
agaaaaacct tcttcatcag 600 
cagaaacaag cagatcagtt 660 
ccaaagccgg cactgtaaca 720 
agctagaagt ccagcaaaaa 780 
aagaaatggt caagctggat 840 
caggcatagc tgaacaaaca 900 
ccggagaaca cggcaaaggc 960 
atacgaaaaa aaccgtctct 1020 
acattgtatc caagcatatc 1080 
tgacgcaaat taaccgctta 1140 
aatcaggcaa aatcgacgtc 1200 
gcgccgtttc ccatgtggcc 1260 
catcaaaaac cggtctgcca 1320 
aaaaatcagt tgacataata 1380 

1390 
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190 


1 








5 










10 










15 




192 


Ser 


Asn 


Gly 


Gin 


Gin 


Lys 


Asn 


Arg 


He 


Gin 


Leu 


Thr 


Asn 


Lys 


His 


Ala 


193 








20 










25 










30 






195 


Asp 


Val 


Lys 


Lys 


Gin 


Leu 


Lys 


Met 


Val 


Arg 


Leu 


Gly 


Asp 


Ala 


Glu 


Leu 


196 






35 










40 










45 








198 


Tyr 


Val 


Leu 


Glu 


Gin 


Leu 


Gin 


Pro 


Leu 


lie 


Gin 


Glu 


Asn 


He 


Val 


Asn 


199 




50 










55 










60 










201 


He 


Val 


Asp 


Ala 


Phe 


Tyr 


Lys 


Asn 


Leu 


Asp 


His 


Glu 


Ser 


Ser 


Leu 


Met 


202 


65 










70 










75 










80 


204 


Asp 


He 


He 


Asn 


Asp 


His 


Ser 


Ser 


Val 


Asp 


Arg 


Leu 


Lys 


Gin 


Thr 


Leu 


205 










85 










90 










95 




207 


Lys 


Arg 


His 


He 


Gin 


Glu 


Met 


Phe 


Ala 


Gly Val 


He 


Asp 


Asp 


Glu 


Phe 


208 








100 










105 










110 






210 


He 


Glu 


Lys 


Arg 


Asn 


Arg 


lie 


Ala 


Ser 


lie 


His 


Leu 


Arg 


lie 


Gly 


Leu 


211 






115 










120 










125 








213 


Leu 


Pro 


Lys 


Trp 


Tyr 


Met 


Gly 


Ala 


Phe 


Gin 


Glu 


Leu 


Leu 


Leu 


Ser 


Met 


214 




130 










135 










140 










216 


He 


Asp 


He 


Tyr 


Glu 


Ala 


Ser 


He 


Thr 


Asn 


Gin 


Gin 


Glu 


Leu 


Leu 


Lys 


217 


145 










150 










155 










160 


219 


Ala 


He 


Lys 


Ala 


Thr 


Thr 


Lys 


He 


Leu 


Asn 


Leu 


Glu 


Gin 


Gin 


Leu 


Val 


220 










165 










170 










175 




222 


Leu 


Glu 


Ala 


Phe 


Gin 


Ser 


Glu 


Tyr 


Asn 


Gin 


Thr 


Arg 


Asp 


Glu 


Gin 


Glu 


223 








180 










185 










190 






225 


Glu 


Lys 


Lys 


Asn 


Leu 


Leu 


His 


Gin 


Lys 


lie 


Gin 


Glu 


Thr 


Ser 


Gly 


Ser 


226 






195 










200 










205 








228 


He 


Ala 


Asn 


Leu 


Phe 


Ser 


Glu 


Thr 


Ser 


Arg 


Ser 


Val 


Gin 


Glu 


Leu 


Val 


229 




210 










215 










220 










231 


Asp 


Lys 


Ser 


Glu 


Gly 


lie 


Ser 


Gin 


Ala 


Ser 


Lys 


Ala 


Gly 


Thr 


Val 


Thr 


232 


225 










230 










235 










240 


234 


Ser 


Ser 


Thr 


Val 


Glu 


Glu 


Lys 


Ser 


He 


Gly 


Gly 


Lys 


Lys 


Glu 


Leu 


Glu 


235 










245 










250 










255 




237 


Val 


Gin 


Gin 


Lys 


Gin 


Met 


Asn 


Lys 


He 


Asp 


Thr 


Ser 


Leu 


Val 


Gin 


lie 


238 








260 










265 










270 






240 


Glu 


Lys 


Glu 


Met 


Val 


Lys 


Leu 


Asp 


Glu 


He 


Ala 


Gin 


Gin 


He 


Glu 


Lys 


241 






275 










280 










285 








243 


He 


Phe 


Gly 


lie 


Val 


Thr 


Gly 


He 


Ala 


Glu 


Gin 


Thr 


Asn 


Leu 


Leu 


Ser 


244 




290 










295 










300 










246 


Leu 


Asn 


Ala 


Ser 


He 


Glu 


Ser 


Ala 


Arg 


Ala 


Gly 


Glu 


His 


Gly 


Lys 


Gly 


247 


305 










310 










315 










320 


249 


Phe 


Ala 


Val 


Val 


Ala 


Asn 


Glu 


Val 


Arg 


Lys 


Leu 


Ser 


Glu 


Asp 


Thr 


Lys 


250 










325 










330 










335 




252 


Lys 


Thr 


Val 


Ser 


Thr 


Val 


Ser 


Glu 


Leu 


Val 


Asn 


Asn 


Thr 


Asn 


Thr 


Gin 


253 








340 










345 










350 






255 


He 


Asn 


He 


Val 


Ser 


Lys 


His 


He 


Lys 


Asp 


Val 


Asn 


Glu 


Leu 


Val 


Ser 


256 






355 










360 










365 








258 


Glu 


Ser 


Lys 


Glu 


Lys 


Met 


Thr 


Gin 


lie 


Asn 


Arg 


Leu 


Phe 


Asp 


Glu 


lie 


259 




370 










375 










380 










261 


Val 


His 


Ser 


Met 


Lys 


He 


Ser 


Lys 


Glu 


Gin 


Ser 


Gly 


L ys 


lie 


Asp 


Val 


262 


385 










390 










395 










400 
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264 Asp Leu Gin Ala Phe Leu Gly Gly Leu Gin Glu Val Ser Arg Ala Val 

265 405 410 415 

267 Ser His Val Ala Ala Ser Val Asp Ser Leu Val lie Leu Thr Glu Glu 

268 420 425 430 

274 <210> SEQ ID NO : 5 

275 <211> LENGTH: 57 

276 <212> TYPE: PRT 

277 <213> ORGANISM: Artificial Sequence 

279 <220> FEATURE: 

280 <223> OTHER INFORMATION: Description of Artificial Sequence: Template 

281 sequence 

283 <220> FEATURE: 

284 <221> NAME/KEY: UNSURE 

285 <222> LOCATION: (4).. (57) 

286 <223> OTHER INFORMATION: Xaa at positions 4, 10, 14, 15, 27, and 41-57 is 

287 unknown 
289 <400> SEQUENCE: 5 

W--> 290 lie lie Lys Xaa Thr Val Pro Val Leu Xaa Glu His Gly Xaa Xaa lie 

291 15 10 15 

W--> 293 Gly Gin Asp Val Leu Val Val Leu He Lys Xaa Asn Pro Glu He Gin 

294 20 25 30 

W--> 296 Glu Lys Phe Phe Phe Phe Lys His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

297 35 40 45 

W--> 299 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

300 50 55 

303 <210> SEQ ID NO: 6 

304 <2il> LENGTH: 55 

305 <212> TYPE: PRT 

306 <213> ORGANISM: Erwinia chrysanthemi 

308 <400> SEQUENCE: 6 

309 He Lys Ser Thr He Pro Leu Leu Ala Glu Thr Gly Pro Ala Leu Thr 

310 15 10 15 

312 Ala His Phe Tyr Gin Arg Met Phe His His Asn Pro Glu Leu Lys Asp 

313 20 25 30 

315 He Phe Asn Met Ser Asn Gin Arg Asn Gly Asp Gin Arg Glu Ala Leu 

316 35 40 45 

318 Phe Asn Ala He Cys Ala Tyr 

319 50 55 

322 <210> SEQ ID NO : 7 

323 <211> LENGTH: 56 

324 <212> TYPE: PRT 

325 <213> ORGANISM: Vitreoscilla stercoraria 

327 <400> SEQUENCE: 7 

328 He He Lys Ala Thr Val Pro Val Leu Lys Glu His Gly Val Thr lie 

329 15 10 15 

331 Thr Thr Thr Phe Tyr Lys Asn Leu Phe Ala Lys His Pro Glu Val Arg 

332 20 25 30 

334 Pro Leu Phe Asp Met Gly Arg Gin Glu Ser Leu Glu Gin Pro Lys Ala 

335 35 40 45 



using n or Xaa. 
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L 


290 


M 


341 


W 


(46) 




or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L 


293 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L 


296 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L 


299 


M 


341 


W 


(46) 


"n n 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L 


753 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


30 


L 


756 


M 


341 


W 


(46) 




or 


"Xaa" 


used, 


for 


SEQ 


ID# 
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L:1062 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 50 

L:1701 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:82 

L:1704 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:82 

L:1723 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:83 

L:1745 M:341 W: (46) n n" or "Xaa" used, for SEQ ID#:84 

L:1764 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:85 

L:1786 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:86 
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